CONTRACTING ELEMENTS AND RANDOM WALKS 



ALESSANDRO SISTO 

Abstract. We define a new notion of contracting element of a group and 
we show that contracting elements coincide with hyperbolic elements in rela- 
tively hyperbolic groups, pseudo-Anosovs in mapping class groups, rank one 
isometries in groups acting properly on proper CAT(0) spaces, elements acting 
hyperbolically on the Bass-Serre tree in graph manifold groups. We also define 
a related notion of weakly contracting element, and show that those coincide 
with hyperbolic elements in groups acting acylindrically on hyperbolic spaces 
and with iwips in Out(F n ), n > 3. We prove that any simple random walk in 
a non-elementary finitely generated subgroup containing a (weakly) contract- 
ing element ends up in a non-(weakly-)contracting element with exponentially 
decaying probability. Also, we show that each (weakly) contracting element is 
contained in a hyperbolically embedded elementary subgroup. 



1. Introduction 

A Morse element in a group G is an element h such that H = (h) is undistorted 
in G and any quasi-geodesic with endpoints on H stays within bounded distance 
from H, and the bound depends on the quasi-isometry constants only. Examples of 
Morse elements include infinite order elements in hyperbolic groups |Gr) , hyperbolic 
elements (of infinite order) in relatively hyperbolic groups |DS] , pseudo-Anosovs in 
mapping class groups |Bej , etc. See |DMSj for further details and examples. Indeed, 
in all mentioned cases a stronger property than being Morse holds. Namely, all 
elements are contracting with respect to a suitable collection, called path system, of 
quasi-geodesic paths in the respective groups. Roughly speaking, the main property 
that a contracting element g satisfies is the existence of a map n g from the group 
onto (g) such that if two points x, y have far away projections then all special paths 
from x to y pass close to TT g (x) and TT g (y). (This definition is less general than 
the one we will give, which is stated in terms of group actions on a metric space.) 
Related properties have been considered in [BFull IBel IA-K1 ISilj . These are the 
examples we can provide, see Section [3] 

Theorem 1.1. Let G be a relatively hyperbolic group (resp. mapping class group, 
group acting properly by isometries on a proper CAT(0) space, graph manifold 
group). Then there exists a path system for G such that g 6 G is contracting if and 
only if g is hyperbolic (resp. is pseudo-Anosov, acts as a rank one isometry, acts 
hyperbolically on the Bass-Serre tree). 

We also define more general notions, that of being weakly contracting, which will 
be sufficient for our applications. 



Theorem 1.2 (Proposition 3.7 Proposition 3.8). Let G be a group acting acylin- 
drically on a hyperbolic space X, and let VS be the collection of all geodesies of X. 
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Then g € G is weakly contracting for the weak path system (X, VS) if and only if 
it acts hyperbolically on X. 

There exists a weak path system for Out(F n ), n > 3, such that g £ Out(F n ) is 
weakly contracting if and only if it is iwip. 

The disadvantage of the weak version of contractivity is that we were not able to 
show that weakly contracting elements are Morse (this is related to (DGOl Problem 
9.4]). 

We will treat all examples mentioned in Theorems and 



1.2 



from a common 



perspective. A recent paper by Dahmani, Guirardel and Osin [DGO has essentially 
the same aim. So, first of all, we will clarify the relation between the notion 
of contracting element and that of hyperbolically embedded subgroup defined in 
[DGO] : 



Theorem 1.3 (Theorem 5.6 I. Each weakly contracting element g is contained in 



a hyperbolically embedded elementary subgroup E{g). 

The subgroup E(g) is defined in a natural way in terms of projections, see 
Corollary |5.4| Notice that in view of Theorem |1.1| we obtain new examples of 
hyperbolically embedded subgroups, i.e. E(g) where g acts as a rank one isometry 
on a proper CAT(0) space, solving [DGOl Problem 9.5]. 

Our main result is that (weakly) contracting elements are generic, in the sense 
specified below. Let G be a group and S C G a finite subset. Let W n (S) be the 
set of words of length n in the elements of S and their inverses. A simple random 
walk supported on (S) is a sequence of random variables {A„} taking values in G 
and with laws \i n so that for each g € G and n € N we have 

\{ w £ W n (S) : w represents g}\ 

Mn({5}) = msj\ ■ 

Theorem 1.4. Let G be a group equipped with a path system (resp. weak path 
system) and let H < G be a non- elementary finitely generated subgroup containing 
a contracting (resp. weakly contracting) element. Then the probability that a simple 
random walk supported on H gives rise to a non- contracting (resp. non-weakly- 
contracting) element decays exponentially in the length of the random walk. 

Roughly speaking, the theorem states that the probability that a long word writ- 
ten down choosing randomly generators of H represents a non-contracting element 
is very small, and in fact exponentially decaying in the length of the word. 

For mapping class groups this was known already |Ma2] , and related results can 
be found in [Rill IRi2l IMall IMS] . We emphasize that we can give a self-contained 
proof of the theorem above modulo Lemma |6.3[ 



Outline. Section 2 contains the definitions we will use throughout. Section 3 is 
devoted to the proof of Theorems 1 1 . 1 1 and 1.2 Section 4 contains our main technical 



tool (Lemma 4.1 ), which gives us a way of constructing many contracting elements 
once we are given just one of them. In Section 5 we will show Theorem |1.3[ and 
finally in Section 6 we will show our main result. In the Appendix we will sketch 



the proof of a "Weak Tits Alternative" not relying on Theorem 1.3 and DGOJ. 



Acknowledgments. The author would like to thank Cornelia Dru^u and Denis 
Osin for helpful conversations and suggestions and Ric Wade for clarifications on 
Out(F n ). 
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2. Definitions and first properties 

Let A be a metric space. 

Definition 2.1. A path system VS in A is a collection of (/i, /x)-quasi-geodesics in 
X, for some /i, such that 

(1) any subpath of a path in VS is in VS, 

(2) all pairs of points in X can be connected by a path in VS. 

Elements of VS will be called VS— special paths, or simply special paths if there 
is no ambiguity on VS . 

Fix a path system VS on the metric space X. 

Definition 2.2. A subset A C X (usually a VS— special path) will be called 
VS— contracting with constant C if there exists tt a — tt : X —¥ A such that 

(1) d(ir(x),x)) < C for each x £ A, 

(2) for each x,y G A, if d(7r(x), n(y)) > C then for any VS— special path 6 
from x to y we have 7r(x)), d(S, ir(y)) < C. 

The map tt will be called VS— projection on A with constant C. 



T x 




y 












y tt(x) J 






(y) J 




Figure 1 







We point out a relation with properties that appeared in the literature in several 
contexts, see [BFuTl lA^Kl [SIT] . 

Lemma 2.3. |Sil[ Lemma 4.24] Let X be a geodesic metric space and A C X. 
Denote by VS the collection of all geodesies in X . Suppose that the map tt : X — > A 
satisfies the following properties for some C: 

• d(x, tt{x)) < d(x, A) + C for each x 6 X, 

• diam(Tr(Bd(x))) < C for each x £ X , where d = d{x, A). 
Then tt is a VS— projection with constant depending on C only. 

The following lemma will be used several times. 

Lemma 2.4. Let tt be a VS— projection with constant C on A C A . Then 

(1) whenever 8 is a special path we have diam(Tr(5)) < diam{8 PI Nc{A)) + 2C 
and more specifically diam(Tr(8)) <C if 5 n Nc(A) = 0. 

Also, there exists k = k(VS) such that 

(2) for each x,y £ X, d{ir{x), 7r(y)) < kd(x, y) + k, 

(3) for each x G A we have diam{Tr{B r {x)) < C for r — d(x,A)/k — k, 

(4) for each x € A, d(x, tt(x)) < kd(x, A) + k. 
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Proof. Item 1) is clear from the second projection property. 

Item 2) follows from special paths being quasi-geodesics and the fact that either 
d(Tt{x), 7r(y)) < C or any special path from x to y passes C— close to Tr(x),ir(y). 

Item 3) holds in view of item 1) because, for c large enough, for each y G B r (x) 
there is a special path connecting x to y and not intersecting Nc{A). 

In order to show item 4), consider a special path 7 from x to some y G A with 
d(x, y) < d(x, A) + 1. If d{y, ir(x)) < C, we are done. Otherwise, 7 contains a point 
x' such that d(x' ^(x)) < C. In particular, 

d(x, n(x)) < d(x, x ) + C < ^d(x, y) + /1 + C < fj,d(x, A) + fi + C + 1, 

and we are done for c large enough. □ 

Lemma 2.5. If k is as in the previous lemma, A, B C X are V 'S contracting 
subsets with constant C and x G X , then 

mm{d(Tr A {x),TrA(B)),d(Tr B (x),7rB(A))} <kC + k + C. 

Proof. If max{d(7r J 4(x), tta(B)), d(n B (x), ir B (A))} < kC + k + C there is nothing to 
prove, so, up to swapping A and B, suppose that d(-K a(x) , it a{B)) > kC + k + C. 
Let S be a special path from x to some x' € B, and let p be the first point in 
S n Nc{A). If we show that there is no point q £ S n Nc(B) before p, we are 
done by Lemma [2~4| -(4) — (1). In fact, if there was such a point q, again by 



Lemma 2.4 -(1) — (4) we would have 

d(TT A (x),TT A (B)) < d(ir A (x),TT A (q)) + d(ir A {q),TT A (B)) <C + kC + k. 

a 

We are ready for our main definitions. 

Definition 2.6. Let G be a group. A path system (X,VS) on the group G is a 
proper action of G on the metric space X preserving the path system VS on X. 

An infinite order element g of G will be called VS— contracting if for some (and 
hence any) xq £ X and any integer n 

(1) the orbit of xo is a quasi-geodesic, 

(2) all special paths connecting xo to g n (xo) are VS— contracting with uniform 
constant. 

Recall from jBFulj that, given a group G acting on the metric space A, the 
element g € G is WPD if the orbits of (g) are Morse quasi-geodesics and for every 
R > and x £ X there exists N = N(R, x) such that 

\{h G G : d(x,h(x)) < R,d(hg N x,g N x) < R}\ < +00. 

The action of G on X will be said to be WPD if every g G G with Morse quasi- 
geodesic orbits is WPD. (This is not standard terminology as WPD actions are 
usually implied to be actions on hyperbolic spaces.) 

We will need the following consequence of the WPD property. 

Lemma 2.7. (cf. BFul, Proposition 6— (2)]) Suppose that G acts on X and g G G 
is WPD. Then for each x G A, R > there exists L so that 

{heG : d(x, hx) < i?, diam(N R (h(g)x) n (g)x) > L} 

is a finite set. 
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The meaning of the lemma is illustrated in Figure [2] Roughly speaking, the 
definition of WPD gives us that there are finitely many elements h so that for some 
large /V we have the situation depicted for N' = TV. The lemma gives us the same 
condition for all large enough TV and without the requirement N' = TV. 




d(x/g N x) 




7 




1 \ 


N \\ 

f x) -V 


27?' 


















N 

9 x j 



Figure 2 



Figure 3 



Proof. Fix x,R,g. First of all notice that, as (g)x is Morse, there exists R' > R 
depending on R (and the data we fixed) so that if d(x, hx) < R, d(g N x, h(g)x) < R 
then for each < n < TV we have d(g n x, h(g)x) < R' . We claim that it suffices to 
show that if 

d(x, hx) < R, d(g N x, hg N 'x) < R 1 (*) 
for some TV, TV' then either d(g N x : hg N x) < C{R) or d{g N x,hg- N x) < C{R), for 
some constant C(R) > R. In fact, if h, h' are as in the second case then hh' is easily 
seen to satisfy d(hh'x,x) < 2R, d(g N x, hh' g N x) < 2C(R). So, the cardinality of 
the set as in the statement is at most the cardinality of 

{heG: d(x, hx) < 2R, d(hg N x, g N x) < 2C(R)}, 

where for L large enough we can choose N = N(C(R)). By definition of N(C(R)) 
this set is finite, as required. 

Now, suppose that (*) holds. Then we have 

\d(g N 'x,x)-d{g N x,x)\ < 2R' . 

So, we just need to show that if N' satisfies this condition then |7V — 7V'| < K(R). 
In order to show this notice that, as the orbits of g are Morse quasi-geodesics, 
there exists a geodesic 7 so that d(g N , 7), d(g N , 7), d(x, 7) < D (where D does not 
depend on 7V, 7V 7 ), see Figure [3j In particular, if TV' satisfies the condition then it 
is contained in a ball of radius, say, 2R' + 1073 around g N x or in a ball of the same 
radius around g~ N x. The existence of K(R) then follows from the orbit of g being 
a quasi-geodesic. □ 

Definition 2.8. Let G be a group. A weak path system [X,VS) on the group G 
is a WPD action of G on the metric space X which preserves the path system VS 
on X. 

An infinite order element g of G will be called weakly VS— contracting if for 
some (and hence any) xq £ X and any integer n 
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(1) the orbit of xq is a quasi-geodesic, 

(2) all special paths connecting xq to g n (xo) are VS~ contracting with uniform 
constant. 

Remark 2.9. A proper action is WPD, and hence contracting elements are weakly 
contracting. 

Lemma 2.10. 

(1) Special paths which are VS— contracting with constant C are Morse with 
constants depending on C only. 

(2) Contracting elements are Morse. 

Proof. The first part can be proven in the same (actually, slightly simpler) way as 
the second part, so we will spell out the proof for the second part only. 

Let G, X, VS, xq be as in the definition above, and let g be a contracting element. 
The fact that (g)xo is a quasi-geodesic easily implies that (g) is a quasi-geodesic in 
G. 

In order to show that g is Morse we now need to show the following. Let a be a 
(/i, c)— quasi-geodesic in G connecting e to g n and let 7 be a special path from Xq 
to g n XQ. Then for each h € a we have d(xo, hxo) < K, where K depends on g, p, c 
and the action of G. A constant depending on the said data will be referred to as 
universal. 

Let 7r be a projection on 7 with constant C . Increase C suitably and define 
7r' : G — » (g) in such a way that d{n' {x)x , n(x)) is bounded by C. Let p be a 
universal constant such that 

d{it (h), 7T (h )) < pd(w(hxo), ttQi xq)) + p 

for each h, h! € G. Let r £ N be the least integer so that d(a(j),a(j + r)) > 
p 2 (pC + 1) + c for each integer j such that j, j + r are in the domain of a. Whenever 
h — a(j), h! — a(j + r) we will say that h, h! are consecutive. Notice that there is 
a universal bound L on d(hxo,h'xo) whenever h,h' £ a are consecutive. Suppose 
that ho, ■ ■ ■ , h n <E a is a maximal chain of consecutive points such that d(hxo, 7) > 



kL + k , for k as in Lemma 2.4 Notice that we can bound the distance of ho, h n 



from 7 again by a universal constant. We wish to show that n cannot be arbitrarily 



large. In fact, by Lemma 2.4-(3) we have d(ir(hi(xo)),Tr(hi+i(xo))) < C, and 
hence d(n(ho(xo)), n(h n (xo))) < nC so that d{ir'{ho), ir'(h n )) < npC + p. For n 
large enough and in view of d(hi, h i+ i) > p 2 (pC + 1) + c we get 

d(ho, h n ) > nr/p - c > } (d(hj, h i+ i) - c)/p 2 - c> 

npC + p + d(h ,TT'(h )) + d(h n ,Ti'(h n )) > d(h Q , h n ), 

which is a contradiction. We can bound d(p, 7) in terms of n,L,r, so we are 
done. □ 



Remark 2.11. In view of the first part of the lemma in order to show that an 
element g is contracting it is enough to find C and a contracting special path j n 
with constant C connecting xq to g n Xo for each n. In fact, once this is done, any 
special path connecting xo to g n Xo will stay within controlled Hausdorff distance 
from 7„. 
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3. Examples 

3.1. Relatively hyperbolic group. Relatively hyperbolic groups were first in- 
troduced in |Grj as a generalization of hyperbolic groups and have several different 
characterizations |Boll IFal IDal lYal IQslI IDS! IDrj . There is also a characterization 
jSil] in terms of projections on the peripheral subgroups, which is the one we will 
use. 

Let G be a group hyperbolic relative to subgroups Hi, ... , H n . Let T(G) be a 
Cayley graph of G with respect to a finite system of generators containing generators 
for each Hi. We will denote by T(Hi) the subgraph of T(G) corresponding to Hi. 
Consider disjoint copies of r(if,) x K, one for each left coset of Hi in G, and glue to 
each gT(Hi) the corresponding copy of r(ifj) x R by identifying T(Hi) x {0} with 
r(-ffj). Denote the resulting space by X = X(G) and notice that there is a natural 
action of G on X by isometries. Let VS be the collection of all geodesies of X. We 
will say that an infinite order element of G is hyperbolic if it not conjugate into 
any Hi. 

Proposition 3.1. In the notation above, an element of G is contracting for the 
path system (X, VS) if and only if it is hyperbolic. 

Proof. First of all, X is asymptotically tree-graded (i.e. metrically relatively hy- 
perbolic) with respect to the glued copies of r(JJj) x M. This can be easily proven 
using the definition of asymptotic tree-gradedness in terms of asymptotic cones 
[DSj . Alternatively, one can apply either |MR] or (Sill Thereom 0.2] (see the proof 
of [Sill Corollary 5.31]). Recall that any hyperbolic element g is contained in an 
elementary subgroup E(g) so that G is hyperbolic relative to Hi, . . . , H n , E(g), see 
[Os2j . In particular, for the same reasons as above, given any hyperbolic element g 
we can add E(g)xo to the collection of peripheral subsets of X, for some xq € X. 
By the definition of relative hyperbolicity given in |Sil] there exists a projection on 
E(g)xo, and hence on all geodesies connecting g~ n xo to g n XQ. 

Finally, observe that thanks to the "thickening" of the peripheral subgroups, any 
non-hyperbolic element is clearly not contracting. □ 

3.2. Mapping class group. In this subsection we will assume familiarity with the 
notion of curve complex, marking complex and hierarchy paths. We will use results 
from (MM11IMM2] . see also [Bo2] . 

Proposition 3.2. Let A4(S) be the marking complex of the closed orientable punc- 
tured surface S and let T-L(S) be the collection of all subpaths of hierarchy paths. An 
element of MCG(S) is contracting for the path system (M(S),'H(S)) if and only 
if it is pseudo-Anosov. 

Remark 3.3. We remark that in view of the proof of |BFul[ Proposition 8.1] (see 



the sixth to last line) and Lemma 2.3 the collection of all geodesies in Teichmiiller 
space endowed with the Weil-Peterson metric has the property that all axes of 
pseudo-Anosov elements are contracting. 

It is well known that hierarchy paths are quasi-geodesics with uniform constant, 
and that any non-pseudo-Anosov element is not Morse as it has infinite index in 
its centralizer. 

Recall from |Bej that a pair of points (fii^fi^) € A4(S) is D— transverse if for 
each Z C S we have dc(z)(Mi>M2) < D. The proposition follows directly from the 
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lemma below and the fact that pairs of points lying on the orbit of a pseudo-Anosov 
element are D— transverse. 

Lemma 3.4. For each D— transverse pair p\,p 2 each hierarchy path [^1,^2] is 
H(S) — contracting with constant C = C(D). 

Proof. Let B be such that if x, y lie on the (subpath of a) hierarchy path [p, q] with 
main geodesic H then 

(1) for each subsurface Y C S we have dc(Y){ x ,y) < dy{p 7 q) + B, 

(2) d c{s) (x,H)<B, 

(3) for any 7 € H there exists z such that dc(s)( x >H) < B. 

Define 7r : M(S) — > [/ii,/i 2 ] as follows. Consider fj, e .M(S) and let 7^ = 
tts(h)- Pick G H (the main geodesic for the given hierarchy path) such that 
dc(s){lii,^ti) = d C (S){ln,H). Finally choose v £ [/ii,/x 2 ] in such a way that 7r s (^) 
is B— close to a M and set 7r(/i) = za 

Pick any 1/1,1/2 € A4(5). Suppose d(n(ui),n(u 2 )) > C(D), where C(D) will be 
determined later. Notice that dc(s)(ot^ 1 ,a„ 2 ) is bounded from below by a linear 
function of C(D), by the distance formula and D— transversality, so that for C(D) 
large enough we can assume dc(s)( a v 1 ,ctv 2 ) ^ 1005, where S is the hyperbolicity 
constant of C(S). Consider a hierarchy path \v\,v 2 \ (more precisely containing 
v\,v 2 , but this does not affect what follows) and let H' be the main geodesic. 
We have that H' contains points /?i,/3 2 which are B— close to the projections of 
1/1,1/2 e \v\,v 2 \ on (7(5*) and such that dc(s)( a vi, Pi) < 105. 




Figure 4 

Our aim is to show that any point [i on [f'l,^] such that 20(5 + 2B + 10 < 
dc(S){ y i Pi) < 20<5+3£>+20 has the property that 7r(^i)) < C(D) (an analogous 
property holds for v 2 ). 

In order to show this, we will now analyze the K— large domains for the pair 
u[, v 2 , where K > 3(D + B) (a L— large domain for a pair of markings pi,p 2 is a 
subsurface Y C S so that dc(y)(/>ii P2) > £)■ First, we claim that there are no 
common (K/3)— large domains for the pairs p!\,v'i and p,' 2 ,v 2 , where p[ € [/ii,^] 
projects B— close to a Ui . 

In fact, any (K/3)— large domain X, for the pair /ij, ^ is contained in some S\ji, 
where 7$ is a simple closed curve appearing in a hierarchy path connecting irs(lA) 
and z^. This implies that X\ 7^ X 2 as dc"(S) (7i>72) > 3 for C{D) large enough. 

We can now use the fact that there are no D— large domains for the pair p! X i\j! 2 . 
Suppose dcvY){y'\i v '2) ^ f° r some subsurface Y C S. Then (at least) one of the 
following must hold: dc(Y){ l/ 'u Mi) > ^/3, ^c(f)(m'h M 2 ) — or ^(^(KjMi) ^ 



CONTRACTING ELEMENTS AND RANDOM WALKS 



9 



K/3. However, the second inequality does not hold by hypothesis. Hence, any 
K— large domain for v'^v'^ and so any (K + B)— large domain for any pair of 
points on [i/^, v' 2 ], is a (K/3) — large domain for fi'^, v' x or /z 2 , v' 2 (but not both). 

With a similar argument we get that for each v £ [i^, 1/3] and each \i £ [//i,/^] 
all (4JT/3 + D + 2B)— large domains Y C 5 for /1, are (K/3)— large domains for 
^ or // 2 , !/ 2 . Choosing v as above we have that irs(v) is far enough from fix and 
/?2 to guarantee that no (4K/3 + D + 2B) — large domain 7CS for ^, \i is a if— large 
domain for fj^, 1^, where [i £ [/iiiMal ^ s sucn that 20<5 + 2£> + 10 < d C (s)(t L , Mi) < 
20(5 + 3-B + 20. By the distance formula n[) can be bounded in terms of S and 
-B, so we are done. □ 

3.3. Graph manifolds. A graph manifold is a compact connected 3-manifold 
(possibly with boundary) which admits a decomposition into Seifert fibred surfaces, 
when cut along a collection of embedded tori and/or Klein bottles. In particular 
a graph manifold is a 3-manifold whose geometric decomposition admits no hyper- 
bolic part. 

Let M be a graph manifold. It is known |MKL| that its universal cover M is 
bilipschitz equivalent to the universal cover N of a flip graph manifold N, that is 
to say a graph manifold with some special properties (among which a metric of 
nonpositive curvature). We will not need the exact definition of such manifolds, 
but we need to know that we can choose a bilipschitz equivalence (j> : M —> N that 
preserves the geometric components. In |Si2], a family of paths VS(N) in N has 
been defined, and those paths satisfy the following: 

Lemma 3.5. 

(1) All paths inVS(N) are bilipschitz, with controlled constant; 

(2) any subpath of a path in VS(N) is again in VS(N); 

(3) if for i = 1,2 Oj is a special path connecting some point in X w . to some 
point in X w i and the vertex v 

• lies on the geodesic connecting Wi,w' i; and 

• d(v,Wi),d(v,w[) > 2, 
then ai n X v = a 2 fl X v . 

Let VS(M) = {g<t)- l (-i) : g G tti(M),7 e VS(N)}. 

Proposition 3.6. (M ) VS(M)) is a path system for ir 1 (M) . An element of -K\(M) 
is contracting if and only if it acts hyperbolically on the Bass-Serre tree of M . 

Proof. The first part follows directly from Lemma [3.5[ -(1) — (2), so we can focus 
on the second part. 

Let us prove the "if" part. Let g £ %i(M) be an element acting hyperbolically 
on the Bass-Serre tree. Let xq £ X v for some vertex v of the Bass-Serre tree such 
that {g l v} is contained in a bi-infmite geodesic. For each n let 7„ be a special path 
from Xq to g n (xo) and define 7r„ : M — > in the following way. Let 7r^ be the 
projection in the Bass-Serre tree on [v, g n v], and define v(x) for each x £ M to be a 
vertex in the Bass-Serre tree such that x £ X v r x y Also, let 7r"(x) £ {v, . . . , g n v} so 
that d(ir'(x), tt"(x)) < d(v, gv)/2. Finally, define ir n (x) to be a point in X^u^ H7„. 

Now, suppose that we have x,y £ M such that d(n n (x) , ir n (y)) is large enough. 
Then we can conclude that, say, d(n'(v(x)),n'(v(y))) > 10. We will now use Lemma 
|3.5| and the fact that cj> preserves the geometric components to find a bound on 
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the distance between any special path 8 connecting x,y and Tr n (x), ir n (y). By 
hypothesis, <f> induces a simplicial isomorphism from the Bass-Serre tree of M to 
that of N, which we will still denote by <f>. Suppose that 8 = h(j> (a), j n = 



h n (p (a n ). Then it is clear from Lemma 3.5 that 8 shares a subpath with any 
special path h(j)~ l (f3) such that j3 connects some point in X^(u-xt v f x \\\ to some 
point in X ( f > !f l - 1 (v(y)))- More precisely, those paths coincide in X w whenever w is 
"well within" [ir' (v(x)) , it' (v(y))]. We can choose j3 so that the endpoints of /i -1 (/>(/3) 
are Xo,g n Xo, and then the fact that j n is Morse with constant not depending on n 
|MKL] clearly implies the desired bound. 

The "only if" part is easy. In fact, if g £ tti(M) is conjugate to some element 
in a vertex group, then it is clearly not even Morse in view of the fact that such 
groups are virtually products of a free group and Z. □ 

3.4. Groups acting acylindrically on hyperbolic spaces and Out(F n ). Recall 
that a group G is said to act acylindrically on the metric space X if for all d there 
exist Rd, Nd so that whenever x, y £ X satisfy d(x, y) > Rd we have 

\{g&G: d(x,gx) < d 7 d{y,gy) < d}\ < N d . 

Graph manifold groups act acylindrically on their Bass-Serre tree. However, we are 



not able to prove the equivalent of Proposition 3.6 for groups acting acylindrically 
on trees or hyperbolic spaces, and indeed it might not be true. Nonetheless, we 
have the following. 

Proposition 3.7. Let G be a group acting acylindrically (by isometries) on the 
hyperbolic space X and let VS be the collection of all geodesies of X . Then g € 
G is weakly contracting for the weak path system (X.VS) if and only if it acts 
hyperbolically on X. 

Proof. Given the following easy fact, the proof just requires unwinding the defi- 
nitions: if X is hyperbolic then all geodesies in X are contracting with the same 
constant. This is, in other forms, well-known (and it follows from more general 
results in |Sil| ). Here is a proof for the convenience of the reader. Let 7 C X be 
a geodesic, and let 7r : X — > 7 be any map so that d(x,Tr(x)) < d(x,j) + 1 for 
each x £ X. Suppose that, for some x,y, we have d(7r(a:), ir(y)) > 88 + 3, where 
S is the hyperbolicity constant of X, and let a be a geodesic from x to y. As 
quadrangles in X are 28— thin (see, e.g. |Grj). we have that [7r(a;), 7r(y)] is con- 
tained in the 28— neighborhood of a U [x,7r(a;)] U {y,ir(y)]. Due to our choice of 
7r, points [ir(x), n(y)] farther than 45 + 1 from the endpoints are not 28— close to 
[x, tt(x)] U [y, 7r(y)]. Hence, there is a point on a which is 28— close to the point on 
[7r(a;), 7r(y)] at distance 45 + 2 from 7r(a;), and so we have d(a, 7r(x)) < 68 + 2. □ 

Bestvina and Feighn proved that the complex of free factors for Out(F n ) is 
hyperbolic, that the action of Out(F n ) on it satisfies WPD and that g € Out(F n ) 
acts hyperbolically if and only if if it is irreducible with irreducible powers (iwip) 
BFc2j Theorem 8.3]. In particular we have the following. 

Proposition 3.8. Let X be the complex of free factors of Out(F n ) for some n > 3. 
Then g G Out(F n ) is weakly contracting for the weak path system (X,VS) if and 
only it is iwip. 

Remark 3.9. For our applications we could also use the complexes constructed in 
[BFeT] . 
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The present state of knowledge about the geometry of Out(F n ) is not advanced 
enough to attempt imitating what we have done, say, for mapping class groups. It 
is not even known whether there is an algebraic characterization of Morse elements 
along the lines of the other cases, and hence the following is perhaps the most basic 
question arising at this point. 

Question 3.10. Is it true that all Morse elements of Out(F n ) for n > 3 are iwip? 

The converse is true by |A-K] . A standard way to show that an (infinite order) 
element is not Morse is to show that (the cyclic group generated by) it has infinite 
index in its centralizer. There might be infinite order non-iwip elements of Out(F n ) 
which have finite index in their centralizers. So, if the answer to the previous 
question is negative, one might ask the following. 

Question 3.11. Is it true that all elements of Out(F n ) that have finite index in 
their (virtual) centralizers are Morse? 

We remark that even if the answer to the first question is negative it might still 
be possible to apply our methods to find out something about iwip's, using a trick 
similar to the one we used for relatively hyperbolic groups, for example. 

3.5. Groups acting properly on proper CAT(0) spaces. Recall that an isom- 
etry of a CAT(0) space is said to be rank one if it is hyperbolic and some (equiva- 
lent^, every) axis of g does not bound a half- flat. 

Remark 3.12. In the case when G is a right-angled Artin group and X its standard 
CAT(0) cube complex, the elements of G that act as rank 1 isometrics coincide with 
those not conjugated into a join subgroup, see |BC] . 

Proposition 3.13. Let G be a group acting properly by isometries on the proper 
CAT(0) space X and let VS be the collection of all geodesies in X . Then g £ G is 
contracting for the path system (X,VS) if and only if it acts as a rank one isometry. 

Proof. It is clear that (X, VS) is a path system on G, and the "only if" part follows 
from the fact that if g does not act as a rank one isometry then it is not Morse. 
Suppose that g acts as a rank one isometry. By IBFult Theorem 5.4] the closest 
point projection on an axis I of g has the property that there exists B so that each 
ball disjoint from I projects onto a set of diameter at most B. The same is true for 
subsegments of the axis as well, see |BFul[ Lemma 3.2]. Property 2) in Definition 
12.21 then follows from Lemma [2~3l □ 



4. Construction of contracting elements 

Let (X, VS) be a weak path system on the group G. Let xo £ X be any point 
and let 7 be a special path connecting xq to gxo. Consider a projection 7r 7 on 



7 with constant C (see Definition 2.2 1. We do not assume that g is (weakly) 
contracting, but it will always be in applications. For h £ G and x £ X, define 
Tfh-y(x) = hiTj{h~ 1 {x)) (this is clearly a projection on /17 with constant C). Notice 
that for each h\,hi £ G and i,i/£lwe have /ii7r/j 27 (ir) = 7r; ll ; l27 (/ii(a;)) so that 

d(ir h2l (x),y) = d(TT hlh27 (hi(x)),hiy). 

This property will be referred to as equivariance. 

Lemma 4.1. Fix the notation above. There exist K = K(VS, C) with the following 
property. Suppose that h £ G is such that 
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(1) d(x ,gx ) - [d(Tr- f (ghx a ),gx ) + d(ir~ ( (h 1 x ), x )] > K. 

(2) diam(it 1 {gh r ))) < C ', diam(jt gh-yil)) < C. 

Then hg is weakly contracting. In particular, if the action of G on X is proper then 
hg is contracting. 

Proof. Set Xi — (hg) l (x ), x\ = (hg) t ~ 1 h(x ) for each integer i. 

xo hxg = x[ 




Figure 5 



Step 1. Structure of the orbit. We claim that, when K is large enough, any 
special path from Xi to Xj passes uniformly close to each Xk and x' k for min{i, j} + 
1 < k < max{i,j}. This clearly implies that the orbit {xi} of (hg) is a quasi- 
geodesic. It is enough to show the claim for i = 0, and we will assume j > 0, the 
proof for j < being analogous. 

Set di = d(ir~ f (h~ 1 Xf)) J Xq), d 2 ~ d(ir 7 (ghxo), gx^), so that (1) can be rewritten 



as d(xo,gxo) > d\ + d 2 + K. For convenience, we increase C so that Lemma 2.5 
holds for C instead of kC + k + C . We will show by induction on j that 

(*_) d(-K (hg)k -i hl (x ),x' k ) <d ± + 2C, and 

d(ir( hg }k-i hl (xj),Xk) < c_ + 2C for each 1 < k < j. 

This implies our claim by the definition of VS— contracting and the hypothesis 
for K large enough. 

(*i) holds by the definition of d\ (and equivariance), while (**i) holds as x\ E 
hj and (** 2 ) by the definition of e_- Suppose that hold. In view of 

equivariance, in order to show we only need to prove d(ir( hg y hj (xo), x'^ +1 ) < 



di + 2C. We wish to apply Lemma 2.5 with A = (hgY^hj, B = (hgyh^. By Step 
1, diam(-KA(B)) < C. Combining this with d(7r^(a;o), x'j) < d\ + 2C (a special case 
of (*_)), d 2 = d^Aixj+ijiXj) and (1) we get 

d{^A(x ),i: A {B)) > 
d(x'j, Xj) — d(TTA{xo), x'j) — diam(ji a(B)) — d(-KA{xj+\), Xj) > K — 2C. 
So, for K large enough and in view of Lemma |2.5| we get d(7r_,(a;o), ttb(A)) < 
C. As diam(TTB(A)) < C and d(x'j +1 , itb{A)) < d\ by the definition of d\ and 
equivariance, we get the desired inequality. 

Let us now show Using equivariance and the inductive hypothesis we can 

deal with all cases with k > 2, so we only need to show d^h-yixj+i), hgxo) < d 2 + C. 
The proof is very similar to that of (*j), so we will just sketch it. We set A = 7, 
B = hgh-f, and notice that d(itB (A), 7r_j(a;j + i) is large, so that by Lemma |2.5| we 
have that d(itA(xj+i)i ^a{B)) is small. As d(hgx 0l TTA(B)) < d 2 and ita(B) has 
bounded diameter, we are done. 
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Step 2. Contracting property. We have to provide projections with uniform 
constant on any special path connecting x a to x n . Pick such a path S, for some 
n > (the proof for n < is analogous). Let x e X. Define n(x) = irs(x) as 
follows. Set 

m(x) = min{fc < n : d(n {hg) k-i hl (x), n^i.-i h7 (x )) < C}, 

and let n(x) be a point on S at distance at most d\ + C from x' m r x y 

The first property is clear from the previous step, as one can easily reduce to 

check it just for the Xj's. 

Let us now show the second projection property. Suppose that \m(x) — m(y)\ > 

10. Set A = (hg)" 1 ^^ 1 ^, B — (hg)™^^ 1 ^. Up to swapping x and y we can 

assume m(y) > m(x). In particular 

rf ( 7r (hg)™<-)- 1 / l7 ( ;z: ): 7r (/ l g)'"(-)- 1 / l7 (y)) > C, 

so that any special path a from x to y passes close to Trtf lg ^m(,x)-if l j(x), which is by 
the previous step and the definition of tt close to n(x). It can be shown similarly 
that a passes close to 7r(y). □ 

The following variation will be used in the Appendix. 

Lemma 4.2. There exist K = K(VS, C) with the following property. Suppose we 
are given a contracting set A with constant C and g,h 6 G with xg,gxo £ A such 
that 

(1) d(x ,gx ) - [d{TT A (ghx Q ),gxo) + d(Tr A (h~ 1 x ) 1 x )} >K. 

(2) diam(ir A (ghA)) < C, diam(ir ghA (A)) < C. 

Then hg is contracting. 

Proof. Notice that we never used in the proof above that 7 is a special path (we 
just used the properties of a projection on 7). Hence, as we now assume Step 1, 
the proof of Steps 2 and 3 can be followed verbatim to prove the lemma. □ 



5. Contracting implies hyperbolically embedded 

When 7 is a special path and x, y € 7, we will write x < y if for all pre-images 
a;', y' under 7 of x, y we have x' < y' . 

Lemma 5.1 (Projections are coarsely monotone). Let 7, 5 be special paths, assume 
that 7 is a (/1, fx) — quasi-geodesic and let 7r 7 be a projection on 7 with constant C. 
There exists K = K(/j,, C) with the following property. Suppose, for i = 0,1, 2, that 
yi = Tr 7 (xi), for some xi € 5. If y a < yi < y 2 and d{y ,y 1 ),d(yi,y 2 ) > K, then 
either Xq < X\ < x 2 or x 2 < X\ < Xq. 

Proof. Suppose by contradiction that xq < x 2 < x\ (the other cases can be dealt 
with in the same way). Then we can find points x, y <E S lying, respectively, between 
xq and x 2 and between x 2 and x\ such that d{x, y\), d(y, y\) is bounded in terms of 
C (we use Lemma [2.10| -(1) here). Also, d(x 2 ,x),d(x 2 ,y) can be made arbitrarily 
large up to increasing K in view of Lemma |2 .4| — (2) . This gives a contradiction 
with 7 being a quasi-geodesic, see Figure [6j □ 

Let (X, VS) be a weak path system on the group G, and let x € X. 
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Lemma 5.2. Suppose that g £ G is weakly contracting and set Hq = (g) . Denote 
iyjm a special path connecting g~ m xo to g m xo- Then there exists K depending only 
on g such that for each h G G and m G N either ir lm (hHoXo) has diameter bounded 
by K or it is K— dense in 7 m . Moreover, there exists K' so that ifd(xo, HqXq) > K' 
then the first case holds. 

Proof. If Hq is contained in a neighborhood of hHo of finite radius, then it is easily 
seen that the second case holds. We can then find h! € hHo such that h'xo is as 
far as we wish from H xq. We claim that if d(h'x Q , H xq) is large enough, then 
TT lm (h'"f m ) cannot contain points on both sides of Tr~ /m (h'xo) and arbitrarily far 
from it. In fact, such points would be by the previous lemma projections of points 
on opposite sides of h'x' , where x' £ j m is close to xq, and hence one can find 
x,y e h'j m with x < h'xo < y such that d(x, 7r 7m (h'xo)), d(y, ir lm (h'xo)) < C, 
where C is the projection constant. This is easily seen to contradict h'j m being a 
quasi-geodesic, see Figure [7] 

Now, if diam(ir lm (hH )) is large enough we can find (see Figure[8]) a large family 
{Ni} of integers such that 

• 7r 7m(i? JVi ' i '7m) has positive Hausdorff distance from 7r 7m (g Nj h'j m ) when- 
ever i 7^ j. This gives in particular that g Ni h' and g Nj h! do not belong to 
the same left coset of H whenever i ^ j . 

• g Ni h'j m contains a point in some ball of fixed radius around g m °Xo, as well 
as a long subpath contained in a neighborhood of "f m of radius depending 
on the projection constant and the quasi-geodesic constants of special paths 
only. This implies, by Lemma 2.7 that all left cosets g Ni h'Ho contain an 
element from a certain fixed finite subset of G. 

Therefore there is a bound on the cardinality of {Ni}, and hence a bound on 
diam(TT 7m (hHQ)). □ 

One can similarly prove the following. 

Lemma 5.3. Suppose that H < G is not virtually cyclic and it contains the weakly 
contracting element g. Then for each K there exists a left coset hHo Q H °f 
Ho = (g) such that d(hHoXQ, HqXq) > K. 

Proof. As H is not virtually cyclic, it is well-known that it is not quasi-isometric 
to Z. In particular, we can find h! such that h! is as far as we wish from Hq. 
So, either d{h' xq, HqXq) can be made arbitrarily large, and in this case we have 
diam(-K lm {h'^ m )) < K by the "moreover" part of Lemma 5.2 or we can use Lemma 
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2.7 to get that ■K lm {h'^ m ) is not K— dense in 7 m for each m, and hence once again 



diam(TT 7m (h'"f m )) < K by Lemma 5.2 It is then easily seen that we can choose h 
of the form h'g N h' with N large enough so that Tth'i m {lm) is far from iTh'j m (h'fm), 
see Figure [9] □ 

As a consequence of the previous lemmas we get: 

Corollary 5.4. Each weakly contracting element is contained in a virtually cyclic 
subgroup, denoted E(g), such that there exists a uniform bound on ir lm {hE{g)) for 
each h (fi E(g) and m. 

Proof. Just define E(g) to be the collection of all h such that ^^ m {h(g)) has Haus- 
dorff distance from 7 m uniformly bounded in m. This is clearly a subgroup con- 
taining g, and it is virtually cyclic by the previous lemma. The uniform bound is 
a consequence of Lemma |5.2| □ 



We point out some extra properties of E{g). 

Lemma 5.5. There exists a positive integer n such that for each h G E(g) we 
either have hg n h~ 1 = g n or hg n h~ 1 — g~ n . In particular, for each integer k, (g nk ) 
is normal in E(g) and E{g) is the centralizer of g 2n . 

Proof. Choose representatives hi, . . . ,hk of the left cosets of (g) in E(g). Choose 
ni > so that hig 711 ^ 1 €E (<?). Such ni exists because there are finitely many left 
cosets of (g), hence one can find fcj > m, such that g ki h~ x and g mi h~ l belong to 
the same left coset and set ni — ki — mi . Choose n = Y[ n i ■ Then it is readily 
seen that hg n h~ x E (g) for each h e E(g). In particular hg n h~ x = g k for some k, 
and we would like to prove k = ±n. Notice that there exists j such that ]i? e (g) . 
Hence 

g =h J g h = g . 
In particular n J = k J , hence k = ±n, as required. □ 

Theorem 5.6. Let G be any group endowed with a fixed weak path system. For 
each weakly contracting element g 6 G, E(g) is hyperbolically embedded in G. 

We will use work of Bestvina, Bromberg and Fujiwara [BBFj . similarly to the 
proof of |DCOl Theorem 4.39]. We will now give the relevant definitions and state 
the result we need. Let Y be a set and for each Y e Y let C(Y) be a geodesic 
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metric space. For each Y let ny ■ Y\{y} — > V(C(Y)) be a function (where V(Y) 
is the collection of subsets of Y). Define 

d Y (X,Z) =diam{n Y {X)UTT Y (Z)}. 

Using the enumeration in |BBF) . consider the following Axioms: 
(0) diam(TT Y (X)) < +00, 

(3) there exists £ so that mhx{d Y (X, Z), d w z (X, Y)} < £, 

(4) there exists £ so that {Y : d Y (X, Z) > £} is a finite set for each IZeY. 
For suitably chosen constants L, K, let C(Y) be the path metric space consisting 

of the union of all C(Y) and edges connecting all points in ttx(Z) to all points in 
ttz(X) whenever X, Z are connected by an edge in a certain complex Vk (Y) whose 
definition we do not need. 

We are ready to state the (special case of the) result we will use. 

Theorem 5.7. [BBFl Theorem 3.10, Lemma 3.1, Corollary 3.12] IfY and d 7 * sat- 
isfy axioms (0), (3) and (4) and each C(Y) is (A, fi) — quasi-isometric to M. for some 
A, /i not depending on Y , then C(Y) is a quasi-tree (in particular, it is hyperbolic) . 

Moreover, each C(Y) is isometrically embedded in C(Y) and for each K there 
exists R so that 

diam(N K (C(X)) n N K (C(Y))) < R 
whenever 1^7 are elements ofY. 

We will use the following characterization of being hyperbolically embedded. 

Theorem 5.8. DGO, Theorem 4.39] Let G be group and H < G a subgroup. 
Suppose that 

(a) G acts by isometries on a hyperbolic space S, 

(b) there exists a quasi-convex H— orbit, 

(c) {H} is geometrically separated, i.e. for every K > and s £ S there exists R 
so that if for some g £ G we have diam(Hs n N^igHs)) > R then g G H , 

(d) H acts on S properly. 

Then H is hyperbolically embedded in G. 



Proof of Theorem \ 5.6\ For technical reasons, in this proof we will allow pro- 
jections to take values in bounded subsets of the target space. More precisely, we 
consider a slightly generalized definition of the set A being VS— contracting with 
constant C where the projection map it a is allowed to take value in subsets of A 



of diameter bounded by C, while properties 1) and 2) in Definition 2.2 are left 
unchanged. If we have a map it a with these properties we can define a projection 
tt' a in the sense of Definition 2.2 just by choosing some tt' a (x) G ka( x ) f° r each 



x £ X. In particular our results, including Lemma [2. 5| and Corollary |5.4| hold for 
this more general notion (possibly with different constants). 

Let Y be the collection of all left cosets of E(g) in G and for each Y £ Y 
let C(Y) be a copy of E(g) (more precisely, a copy of the Cayley graph of E{g) 
with respect to a given finite set of generators), which we regard as identified 
with E(g)xo, for some given xq. Each orbit hE(g)xo is a contracting set, and the 
constant can be chosen uniformly. Actually, we have more. In fact, there exists 
a collection of equivariant projections on the orbits: choose a projection Tr' e on 
E(g)xo, define n e (x) = U/ig.E( s ) hir' e (h~ l x) and define a projection on hE(g)xo by 
7T/j(x) = hit e (h~ 1 x). In order to show that ir^ is actually a projection we just 
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need to prove that we can uniformly bound diam(TTh(x)) for each x. This follows 
directly from the easily checked fact that there exist constants Di,D 2 so that for 
each special path 7 from x to irh( x ) we have diam(j n Nrj 1 (E(g)xo)) < D 2 . We 
can now define 

7T hEig) (h'E{g)) = h- 1 TT h {h'E(g)x ). 



Axiom (0) follows from Corollary 5.4 while Lemma 2.5 implies Axiom (3). Ax- 
iom (4) is easy: if £ is large enough then 

\{hE(g) : d hE{g) (h 1 E(g),h 2 E(g)) > £}| < d(h 1 E(g)x ,h 2 E(g)x ) 

because all special paths from hiE(g)x$ to h 2 E(g)xo have long disjoint subpaths 
each contained in some hE(g)xo for hE(g) satisfying df lE r g \(hiE(g),h2E(g)) > £. 
As all axioms are satisfied, Theorem |5.7| applies. Notice that all constructions are 



equivariant (including that of Vk(Y)), so that there is a natural action of G on 



C(Y). We wish to show that this action satisfies all properties of Theorem 5.8 with 



H = E{g) and S = C(Y). In fact, in view of Theorem 5.7 C(Y) is hyperbolic. 
Also, there exists an orbit of E(g) which is an isometrically embedded copy of (the 
vertices in a Cayley graph of) E(g) and E{g) acts on it in the natural way, so that 
(b) and (d) are satisfied. Finally, (c) follows from the "moreover" part of Theorem 
1571 □ 

6. Random walks 

Recall that our aim is to prove the following. 

Theorem 6.1. Let G be a group supporting the path system (resp. weak path 
system) (X,VS) and let H < G be a non- elementary finitely generated subgroup 
containing a contracting (resp. weakly contracting) element. Then the probability 
that a simple random walk {X n } supported on H gives rise to a non- contracting 
(resp. non-weakly-contracting) element decays exponentially in the length of the 
random walk. 

Let g 6 H be (weakly) contracting and let 7 m be special paths connecting 
g~ m {x Q ) to 5™(a;o) for each m e N. 

Lemma 6.2. There exists K > 1, n > and < a < 1 with the following 
property. For each m, the probability that d(ir lm (X n x ) , x ) > I is at most Ka l for 
each n > tiq. 

Proof. We will show that 

P[d{ir Jm (X n x ),x ) £ [1,1 + K]} > ^-P[d{Tr lm (X n x ),x ) >l + K], (*) 

A 

for K large enough. This implies the lemma, as one can then show inductively that 

1 



P(cZ(7r 7m (A„zo),xo) >jK)<(l- 



K + 1 



By Lemma 5.4 we can find Kq > and an element h € G such that 7r 7m (/i7 m ) C 
Bk (xq) for each m. We can also assume, up to increasing Kq, that iTh^ m {lm) ( ~ 
Bk q {xq). Let n = \h\. Set d = d(xo,h(xo)) and d' — max{d(xo, sxo) : s G S}, 
where S is the system of generators of H used to construct the random walk. Now, 
let g be a word in S. Suppose that 

d{n lm (gx ),x ) > I + (n + l)(kd' + k)+K + L + l, (*) 
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where L = kC + k + C for k as in Lemma |2.5| Now, write g — <?o9i.92 in such a way 
that 

• 4 f 7m (50^0)7 %o) > I + Kq, and go is the shortest initial sub-word of g with 
this property, 

• \92\ = \h\. 



It is possible to find such g^s in view of (*) and Lemma 2.4 -(2). 

Now, it is easily seen using Lemma |2.5| with A = j m and B = gohj m that, for 
g' = g hgi, we have 

d(TT^ m (g'x ),x a ) e [1,(1 + K + (kd' + k)) + K Q + L + 1] = [1,1 + K x ]. 

There is K2 such that V[X n — g'] > P[X n — g]/K2, and at most K2 words g 
modified according to the procedure above transform into a given g' . We can now 
set K = max{Kf} and we are done. □ 

Here is the last preliminary result, which easily follows from standard facts about 
random walks on non-amenable groups. 

Lemma 6.3. For each I, the probability that {X n } ends up in a point in Bi(e) 
decays exponentially in n. 

Proof. We recall some notation from |Wo] , Let pW(e, y) to be the probability that 
the random walk ends up in y after n steps. The spectral radius of the random 

walk is defined as 

p = limsupp(")(e,y) 1/n , 

where y is any element of H (it does not depend on y). As we point out in the 
Appendix, H contains a free group on two generators and so it is not amenable. 
Hence, [Wol Corollary 12.12] gives p < 1. Consider some p < a < 1. It is quite 
clear from the definitions that for n large enough the probability as in the statement 
satisfies 

p (n) (e,2/)<|A(e)K, 

yeB t (e) 

which is what we were aiming for. □ 
We are now ready to prove the theorem. 

Proof of Theorem \6.1\ With a slight abuse, we will sometimes regard X n as a 
random variable with values in W n (S) instead of G (recall that W n (S) is the set 
of words of length n in elements of S and their inverses, where S is a fixed finite 
generating set for H). Let m be an integer to be determined later. The probability 
that X n does not contain the (weakly) contracting element g m as a subword decays 
exponentially in n, let us write 

for some < (3 < 1 and K m > 1. Let I be any positive real number. In view of the 



previous lemma and Lemma 4.1 we can estimate P[X n is (weakly) contracting] in 
the following way. Suppose that g m C X n , the estimate on the probability of this 
happening is above. For convenience let us assume that g m is a final subword. We 
can do this as V[X n — W1W2W3] = V[X n = W3W1W2] for any words w±, W2, W3. In 
view of Lemma |4.1| applied to g = g m and h the initial subword of X n before g, 
we have that X n is (weakly) contracting if d(ir lm Qixq), Xq)), rf(7r 7m (h~ 1 xo), Xq) are 
both bounded by I (for m large enough depending on I) and d(l, h) is large enough 
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to guarantee, together with the previous condition, that h ^ E(g). (In order to 
apply Lemma |4.1| we first increase the projection constant C to be larger than 



the constant in Lemma 5.2 1. We can estimate the probability that this happens 



using, respectively, Lemma 6.2 applied to X n and X n (we use that the law of the 



random walk is invariant under taking inverses) and Lemma 6.3 To sum up, we 
have constants m(l),no such that, for m > m(l) and n > no, 

P[X n not (weakly) contracting] < K m /3^ + 2Ka l , 

where we suitably increased K m ,/3 m so that K m j3^ bounds from above (for n > 
n(l)) the sum of probabilities that g rn <£. X n and that d(l,h) is smaller than re- 
quired. Notice that this estimate is already good enough to show that the desired 
probability tends to 0, but we want a stronger result. In order to obtain it, no- 
tice that we can "iterate" the estimate, meaning that if the projections fail to be 
controlled, we can repeat the procedure above with h instead of X n , i.e. we can 
estimate the probability that h (regarded as a random walk with fewer steps) does 
not contain g m etc. 

This gives the estimate 

P[X n not (weakly) contracting] < K m f3^ + 2Ka l (K rn f3^ md + 2Ka l ), 

where d is the length of g. We can iterate this j times if n — jmd > no- Iterating 
until we can we get 

P[X n not (weakly) contracting] < P|_(n-n )/(m<J)J > 
where pj is defined inductively in the following way: 

( Pl = 2Ka l , 

\ Pj = K m 0^ d+no + 2Ka l Pj -i for j > 2. 

One can prove inductively that pj < {K m j3 n °)j max{/3 m d, 2Ka l y , so that pj 
decays exponentially, and this completes the proof. □ 

Appendix: Weak Tits Alternative 

In this appendix we sketch the proof of a result implied by Theorem |5.6| and 
results in DGO] (see, e.g., Theorem 2.23), that is to say that subgroups containing 
(weakly) contracting elements are either virtually cyclic or they contain a free group 
on two generators. We do so, as explained in the introduction, to show that our 
techniques can prove Theorcm|1.4|in an almost self-contained way. 



Lemma 6.4. Let gi,<?2 be weakly contracting elements and let "fi. m be a special 
path connecting g~ m xo to g™xo- Suppose 

sup{diam(7r 7lij (7 2i fe)),(iiam(7r 72ij (7i ife ))} < +oo. 

Then there exists no such that, for each \n\, \m\ > no, g n and h m form a basis of a 
free subgroup of G. 

Proof. [Sketch] Set Y = [J g eG jenid^J U 9l2,j) Q X. Clearly, Y is G— invariant. 

Let TTi : Y — > {—N, . . . , N} be such that d(g^ xq, 7r 7 . n2 (x)) is minimal for each 

x e Y. Set Af = {y e Y : i^i{y) > 0}. We just have to verify the hypotheses 
of the Ping Pong Lemma. Let us show, for n large enough compared to N, that 
g™(Af, x U Af) C Af, the analogous fact for g^ n admitting the same proof. Let 
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us start by showing that g^(Af) C Af . Let x € and consider a special path 5 
connecting x to gf 7T ' . We have (up to taking n large enough) that gfS cannot pass 
close to {gf k xo}k<o, for otherwise 5 would do the sam e, and this forces ni(gfx) > 0. 



Let us now show gf(Af +1 ) C Af. By Lemma 2.5 and N large enough, we have 
7Ti(A i+1 ) = 0, so that a special path from x G A i+1 to 7r 7 . n2 (a;) will stay far from 
each gf fk for k ^ 0. Hence, an argument very similar to the one above gives that 

*i(g?x) > o. □ 

Theorem 6.5 (Weak Tits Alternative). Let G be a finitely generated group equipped 
with the weak path system (X,VS). Let H < G be a subgroup containing a weakly 
VS— contracting element. Then either H is virtually cyclic or it contains a free 
group on two generators. 

The theorem above was known already in many cases. For relatively hyperbolic 
groups it essentially follows from the proofs in [Xij . for mapping class groups see 
[EBlMc], for Out(F n ) see |TKL] (the "full" Tits Alternative is also known |BFH1| 
IBFH2| L for right-angled Artin groups it follows from Tits' original result |Tij in 
view of the fact that they are linear |HW| IDJj (and standard facts about CAT(0) 
spaces), and for graph manifold groups one just needs to use some Bass-Serre theory. 

Proof. [Sketch] Let g G H be a weak VS— contracting element and let x G X. 
If H is not virtually cyclic then we can find h G H with h ^ E(g) (see Corollary 



5.4). We can use Lemma 4.1 to show that hg n is weakly contracting for some large 
enough n. 

Now, let 7 m (resp. 5 m ) be a special path connecting xq to g mn xo (resp. xq 
to (hg n ) m xo). We wish to find a uniform bound on the diameters of 7r 7 .(5fe) and 



ns-ilk), so that Lemma 6.4 gives us the wished free group. Actually, it is enough 
to bound just the diameter Dj t k of, say, ■n lj {5k), and the bound for irsA'Jk) will 
follow easily. 

If Djk is large enough then we would be able to find a bound on d(hxo, (g)xo) 
depending on g only, as in this case a long subpath of 5k would travel parallel to 
jj. We can assume that such bound does not hold. □ 

We point out a consequence of the Weak Tits Alternative. The following result 
is implicit in |Xi| . 

Corollary 6.6. [Relative Tits Alternative] Let G be a torsion-free relatively hyper- 
bolic group. Any H < G either is virtually cyclic, it contains a free group on two 
generators or it is parabolic (i.e. contained in a conjugate of a peripheral subgroup). 

In particular, G as above satisfies the Tits alternative if and only if its peripheral 
subgroups do. 



Proof. In view of the Weak Tits Alternative and Proposition |3.1| we only need to 
show that if H < G does not contain a hyperbolic element then it is parabolic. 
This is an easy consequence of the following lemma. 

Lemma 6.7. If H\,H2 are peripheral subgroups, g G Hi, h = kh'k^ 1 G fc-f^fc -1 
for some k G G, Hi ^ kH 2 and d(e, g), d{e, hf) are large enough compared to d{e, k), 
then gh is hyperbolic. 
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Proof. We will use the path system (X,VS) constructed in Proposition 3.1 and 



apply Lemma 4.2 with A — Hi. By |Silj . the projection of H on H', whenever 
H =/= H' are left cosets of peripheral subgroups, is uniformly bounded by, say, 
C, and a similar statement holds true in X. Also, we can assume h ^ H\ as 
diam(Hir\NM XOi kx )(kH2)) can be bounded in terms of d(xo, kxo) by DS, Theorem 



4.1— (ai)]. In particular ghH\ ^ H\ so that condition 2) of Lemma 4.2 is satisfied. 
Indeed, condition 1) is also satisfied as d{~KHx{ghxo), gxo) can be bounded in terms 
of d(xo, kxo), as well as d^ir^ {h xq), xq). □ 

In order to conclude the proof notice if H < G contains parabolic elements h 
not contained in the same conjugate of some peripheral subgroup, up to conjugating 
both of them and taking high enough powers we can reduce to the situation of the 
lemma (as G is torsion free), and hence we have that H contains a hyperbolic 
element. □ 

Remark 6.8. Notice that the lemma in the proof above implies results by Osin 
jOs2j on the existence of hyperbolic elements. 
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